Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 3191 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 481 |
| Duplicate rows (%) | 15.1% |
| Total size in memory | 478.1 KiB |
| Average record size in memory | 153.4 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 12 |
| Dataset has 481 (15.1%) duplicate rows | Duplicates |
alcohol is highly overall correlated with density | High correlation |
chlorides is highly overall correlated with density and 4 other fields | High correlation |
density is highly overall correlated with alcohol and 2 other fields | High correlation |
fixed acidity is highly overall correlated with chlorides and 2 other fields | High correlation |
free sulfur dioxide is highly overall correlated with total sulfur dioxide | High correlation |
residual sugar is highly overall correlated with type | High correlation |
sulphates is highly overall correlated with type | High correlation |
total sulfur dioxide is highly overall correlated with chlorides and 2 other fields | High correlation |
type is highly overall correlated with chlorides and 5 other fields | High correlation |
volatile acidity is highly overall correlated with chlorides and 1 other fields | High correlation |
citric acid has 140 (4.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-30 19:33:42.336411 |
|---|---|
| Analysis finished | 2024-10-30 19:33:55.576382 |
| Duration | 13.24 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
type
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 178.9 KiB |
| Moscatel | |
|---|---|
| Syrah |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 6.5023504 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moscatel |
|---|---|
| 2nd row | Moscatel |
| 3rd row | Moscatel |
| 4th row | Moscatel |
| 5th row | Moscatel |
Common Values
| Value | Count | Frequency (%) |
| Moscatel | 1598 | |
| Syrah | 1593 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| moscatel | 1598 | |
| syrah | 1593 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3191 | |
| M | 1598 | |
| s | 1598 | |
| o | 1598 | |
| c | 1598 | |
| t | 1598 | |
| e | 1598 | |
| l | 1598 | |
| S | 1593 | |
| y | 1593 | |
| Other values (2) | 3186 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20749 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3191 | |
| M | 1598 | |
| s | 1598 | |
| o | 1598 | |
| c | 1598 | |
| t | 1598 | |
| e | 1598 | |
| l | 1598 | |
| S | 1593 | |
| y | 1593 | |
| Other values (2) | 3186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20749 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3191 | |
| M | 1598 | |
| s | 1598 | |
| o | 1598 | |
| c | 1598 | |
| t | 1598 | |
| e | 1598 | |
| l | 1598 | |
| S | 1593 | |
| y | 1593 | |
| Other values (2) | 3186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20749 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3191 | |
| M | 1598 | |
| s | 1598 | |
| o | 1598 | |
| c | 1598 | |
| t | 1598 | |
| e | 1598 | |
| l | 1598 | |
| S | 1593 | |
| y | 1593 | |
| Other values (2) | 3186 |
fixed acidity
Real number (ℝ)
High correlation 
| Distinct | 100 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4250078 |
| Minimum | 3.8 |
|---|---|
| Maximum | 15.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 5.6 |
| Q1 | 6.4 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10.7 |
| Maximum | 15.9 |
| Range | 12.1 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.5984636 |
|---|---|
| Coefficient of variation (CV) | 0.21528107 |
| Kurtosis | 2.7575382 |
| Mean | 7.4250078 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 1.4700301 |
| Sum | 23693.2 |
| Variance | 2.555086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.6 | 155 | 4.9% |
| 6.8 | 151 | 4.7% |
| 6.4 | 144 | 4.5% |
| 6.7 | 119 | 3.7% |
| 7.2 | 116 | 3.6% |
| 7 | 114 | 3.6% |
| 6 | 114 | 3.6% |
| 7.1 | 111 | 3.5% |
| 6.9 | 103 | 3.2% |
| 6.5 | 101 | 3.2% |
| Other values (90) | 1963 |
| Value | Count | Frequency (%) |
| 3.8 | 1 | < 0.1% |
| 3.9 | 1 | < 0.1% |
| 4.4 | 3 | 0.1% |
| 4.6 | 1 | < 0.1% |
| 4.7 | 6 | 0.2% |
| 4.8 | 7 | 0.2% |
| 4.9 | 5 | 0.2% |
| 5 | 19 | |
| 5.1 | 14 | |
| 5.2 | 16 |
| Value | Count | Frequency (%) |
| 15.9 | 1 | |
| 15.6 | 2 | |
| 15.5 | 2 | |
| 15 | 2 | |
| 14.3 | 1 | |
| 14 | 1 | |
| 13.8 | 1 | |
| 13.7 | 2 | |
| 13.5 | 1 | |
| 13.4 | 1 |
volatile acidity
Real number (ℝ)
High correlation 
| Distinct | 171 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.40464275 |
| Minimum | 0.085 |
|---|---|
| Maximum | 1.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0.085 |
|---|---|
| 5-th percentile | 0.17 |
| Q1 | 0.26 |
| median | 0.36 |
| Q3 | 0.53 |
| 95-th percentile | 0.7475 |
| Maximum | 1.58 |
| Range | 1.495 |
| Interquartile range (IQR) | 0.27 |
Descriptive statistics
| Standard deviation | 0.18965419 |
|---|---|
| Coefficient of variation (CV) | 0.4686954 |
| Kurtosis | 1.0761404 |
| Mean | 0.40464275 |
| Median Absolute Deviation (MAD) | 0.12 |
| Skewness | 0.99709392 |
| Sum | 1291.215 |
| Variance | 0.035968712 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28 | 120 | 3.8% |
| 0.24 | 102 | 3.2% |
| 0.22 | 98 | 3.1% |
| 0.27 | 89 | 2.8% |
| 0.26 | 87 | 2.7% |
| 0.32 | 87 | 2.7% |
| 0.3 | 85 | 2.7% |
| 0.31 | 85 | 2.7% |
| 0.36 | 79 | 2.5% |
| 0.38 | 73 | 2.3% |
| Other values (161) | 2286 |
| Value | Count | Frequency (%) |
| 0.085 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.105 | 4 | 0.1% |
| 0.11 | 5 | 0.2% |
| 0.12 | 10 | 0.3% |
| 0.13 | 8 | 0.3% |
| 0.14 | 15 | 0.5% |
| 0.145 | 2 | 0.1% |
| 0.15 | 31 | |
| 0.16 | 48 |
| Value | Count | Frequency (%) |
| 1.58 | 1 | |
| 1.33 | 2 | |
| 1.24 | 1 | |
| 1.185 | 1 | |
| 1.18 | 1 | |
| 1.13 | 1 | |
| 1.115 | 1 | |
| 1.1 | 1 | |
| 1.09 | 1 | |
| 1.07 | 1 |
citric acid
Real number (ℝ)
Zeros 
| Distinct | 83 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.28797242 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 140 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.01 |
| Q1 | 0.2 |
| median | 0.28 |
| Q3 | 0.37 |
| 95-th percentile | 0.56 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.17 |
Descriptive statistics
| Standard deviation | 0.15735966 |
|---|---|
| Coefficient of variation (CV) | 0.54644004 |
| Kurtosis | 0.42839103 |
| Mean | 0.28797242 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 0.3139611 |
| Sum | 918.92 |
| Variance | 0.024762063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28 | 155 | 4.9% |
| 0.3 | 143 | 4.5% |
| 0 | 140 | 4.4% |
| 0.26 | 130 | 4.1% |
| 0.32 | 126 | 3.9% |
| 0.27 | 123 | 3.9% |
| 0.24 | 115 | 3.6% |
| 0.29 | 111 | 3.5% |
| 0.25 | 91 | 2.9% |
| 0.33 | 87 | 2.7% |
| Other values (73) | 1970 |
| Value | Count | Frequency (%) |
| 0 | 140 | |
| 0.01 | 37 | 1.2% |
| 0.02 | 52 | 1.6% |
| 0.03 | 30 | 0.9% |
| 0.04 | 32 | 1.0% |
| 0.05 | 21 | 0.7% |
| 0.06 | 26 | 0.8% |
| 0.07 | 22 | 0.7% |
| 0.08 | 33 | 1.0% |
| 0.09 | 37 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 2 | 0.1% |
| 0.91 | 2 | 0.1% |
| 0.86 | 1 | < 0.1% |
| 0.82 | 1 | < 0.1% |
| 0.79 | 2 | 0.1% |
| 0.78 | 2 | 0.1% |
| 0.76 | 3 | |
| 0.75 | 1 | < 0.1% |
| 0.74 | 5 | |
| 0.73 | 4 |
residual sugar
Real number (ℝ)
High correlation 
| Distinct | 234 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5203855 |
| Minimum | 0.7 |
|---|---|
| Maximum | 26.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0.7 |
|---|---|
| 5-th percentile | 1.2 |
| Q1 | 1.9 |
| median | 2.5 |
| Q3 | 6.05 |
| 95-th percentile | 13.8 |
| Maximum | 26.05 |
| Range | 25.35 |
| Interquartile range (IQR) | 4.15 |
Descriptive statistics
| Standard deviation | 4.1507944 |
|---|---|
| Coefficient of variation (CV) | 0.91823905 |
| Kurtosis | 2.1537261 |
| Mean | 4.5203855 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | 1.6779611 |
| Sum | 14424.55 |
| Variance | 17.229094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 187 | 5.9% |
| 1.8 | 155 | 4.9% |
| 2.2 | 144 | 4.5% |
| 2.1 | 140 | 4.4% |
| 1.9 | 139 | 4.4% |
| 2.3 | 122 | 3.8% |
| 2.4 | 104 | 3.3% |
| 2.5 | 103 | 3.2% |
| 1.6 | 96 | 3.0% |
| 1.7 | 94 | 2.9% |
| Other values (224) | 1907 |
| Value | Count | Frequency (%) |
| 0.7 | 2 | 0.1% |
| 0.8 | 5 | 0.2% |
| 0.9 | 16 | 0.5% |
| 1 | 29 | 0.9% |
| 1.1 | 52 | |
| 1.15 | 1 | < 0.1% |
| 1.2 | 74 | |
| 1.3 | 55 | |
| 1.4 | 84 | |
| 1.45 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 26.05 | 2 | |
| 22.6 | 1 | < 0.1% |
| 20.8 | 1 | < 0.1% |
| 20.3 | 1 | < 0.1% |
| 20.15 | 1 | < 0.1% |
| 19.95 | 1 | < 0.1% |
| 19.9 | 1 | < 0.1% |
| 19.5 | 1 | < 0.1% |
| 19.4 | 1 | < 0.1% |
| 19.3 | 3 |
chlorides
Real number (ℝ)
High correlation 
| Distinct | 192 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.066371357 |
| Minimum | 0.009 |
|---|---|
| Maximum | 0.611 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0.009 |
|---|---|
| 5-th percentile | 0.03 |
| Q1 | 0.042 |
| median | 0.058 |
| Q3 | 0.08 |
| 95-th percentile | 0.114 |
| Maximum | 0.611 |
| Range | 0.602 |
| Interquartile range (IQR) | 0.038 |
Descriptive statistics
| Standard deviation | 0.042076945 |
|---|---|
| Coefficient of variation (CV) | 0.6339624 |
| Kurtosis | 41.614527 |
| Mean | 0.066371357 |
| Median Absolute Deviation (MAD) | 0.019 |
| Skewness | 5.0140462 |
| Sum | 211.791 |
| Variance | 0.0017704693 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.036 | 76 | 2.4% |
| 0.048 | 75 | 2.4% |
| 0.044 | 74 | 2.3% |
| 0.05 | 69 | 2.2% |
| 0.08 | 66 | 2.1% |
| 0.047 | 63 | 2.0% |
| 0.042 | 61 | 1.9% |
| 0.041 | 56 | 1.8% |
| 0.076 | 55 | 1.7% |
| 0.035 | 55 | 1.7% |
| Other values (182) | 2541 |
| Value | Count | Frequency (%) |
| 0.009 | 1 | < 0.1% |
| 0.012 | 2 | 0.1% |
| 0.013 | 1 | < 0.1% |
| 0.014 | 2 | 0.1% |
| 0.015 | 4 | |
| 0.016 | 1 | < 0.1% |
| 0.017 | 3 | |
| 0.018 | 4 | |
| 0.019 | 1 | < 0.1% |
| 0.02 | 6 |
| Value | Count | Frequency (%) |
| 0.611 | 1 | < 0.1% |
| 0.61 | 1 | < 0.1% |
| 0.467 | 1 | < 0.1% |
| 0.464 | 1 | < 0.1% |
| 0.422 | 1 | < 0.1% |
| 0.415 | 3 | |
| 0.414 | 2 | |
| 0.413 | 1 | < 0.1% |
| 0.403 | 1 | < 0.1% |
| 0.401 | 1 | < 0.1% |
free sulfur dioxide
Real number (ℝ)
High correlation 
| Distinct | 99 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.528048 |
| Minimum | 1 |
|---|---|
| Maximum | 289 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 12 |
| median | 23 |
| Q3 | 35 |
| 95-th percentile | 57 |
| Maximum | 289 |
| Range | 288 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 17.408059 |
|---|---|
| Coefficient of variation (CV) | 0.68191893 |
| Kurtosis | 17.506675 |
| Mean | 25.528048 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 2.0366479 |
| Sum | 81460 |
| Variance | 303.04051 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 148 | 4.6% |
| 5 | 110 | 3.4% |
| 15 | 98 | 3.1% |
| 10 | 94 | 2.9% |
| 17 | 91 | 2.9% |
| 12 | 90 | 2.8% |
| 16 | 83 | 2.6% |
| 26 | 81 | 2.5% |
| 29 | 79 | 2.5% |
| 7 | 79 | 2.5% |
| Other values (89) | 2238 |
| Value | Count | Frequency (%) |
| 1 | 3 | 0.1% |
| 2 | 2 | 0.1% |
| 3 | 51 | 1.6% |
| 4 | 43 | 1.3% |
| 5 | 110 | |
| 5.5 | 1 | < 0.1% |
| 6 | 148 | |
| 7 | 79 | |
| 8 | 63 | |
| 9 | 67 |
| Value | Count | Frequency (%) |
| 289 | 1 | < 0.1% |
| 124 | 1 | < 0.1% |
| 112 | 1 | < 0.1% |
| 108 | 3 | |
| 105 | 2 | |
| 101 | 2 | |
| 98 | 3 | |
| 97 | 1 | < 0.1% |
| 87 | 2 | |
| 81 | 3 |
total sulfur dioxide
Real number (ℝ)
High correlation 
| Distinct | 228 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.897681 |
| Minimum | 6 |
|---|---|
| Maximum | 440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 38 |
| median | 88 |
| Q3 | 127 |
| 95-th percentile | 183 |
| Maximum | 440 |
| Range | 434 |
| Interquartile range (IQR) | 89 |
Descriptive statistics
| Standard deviation | 54.620972 |
|---|---|
| Coefficient of variation (CV) | 0.62141539 |
| Kurtosis | -0.27048547 |
| Mean | 87.897681 |
| Median Absolute Deviation (MAD) | 45 |
| Skewness | 0.41764604 |
| Sum | 280481.5 |
| Variance | 2983.4506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 111 | 41 | 1.3% |
| 28 | 41 | 1.3% |
| 113 | 38 | 1.2% |
| 24 | 36 | 1.1% |
| 15 | 35 | 1.1% |
| 18 | 35 | 1.1% |
| 122 | 34 | 1.1% |
| 23 | 34 | 1.1% |
| 20 | 33 | 1.0% |
| 14 | 33 | 1.0% |
| Other values (218) | 2831 |
| Value | Count | Frequency (%) |
| 6 | 3 | 0.1% |
| 7 | 4 | 0.1% |
| 8 | 14 | 0.4% |
| 9 | 15 | |
| 10 | 28 | |
| 11 | 26 | |
| 12 | 29 | |
| 13 | 28 | |
| 14 | 33 | |
| 15 | 35 |
| Value | Count | Frequency (%) |
| 440 | 1 | |
| 289 | 1 | |
| 278 | 1 | |
| 259 | 1 | |
| 251 | 1 | |
| 248 | 2 | |
| 243 | 1 | |
| 240 | 1 | |
| 237 | 2 | |
| 230 | 1 |
density
Real number (ℝ)
High correlation 
| Distinct | 862 |
|---|---|
| Distinct (%) | 27.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9128882 |
| Minimum | 0.98711 |
|---|---|
| Maximum | 100.369 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0.98711 |
|---|---|
| 5-th percentile | 0.98963 |
| Q1 | 0.99259 |
| median | 0.99551 |
| Q3 | 0.99726 |
| 95-th percentile | 0.9994 |
| Maximum | 100.369 |
| Range | 99.38189 |
| Interquartile range (IQR) | 0.00467 |
Descriptive statistics
| Standard deviation | 8.8018583 |
|---|---|
| Coefficient of variation (CV) | 4.6013449 |
| Kurtosis | 118.81552 |
| Mean | 1.9128882 |
| Median Absolute Deviation (MAD) | 0.00209 |
| Skewness | 10.91168 |
| Sum | 6104.0261 |
| Variance | 77.472709 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9976 | 37 | 1.2% |
| 0.9968 | 37 | 1.2% |
| 0.9972 | 37 | 1.2% |
| 0.9984 | 35 | 1.1% |
| 0.998 | 32 | 1.0% |
| 0.9964 | 29 | 0.9% |
| 0.9962 | 28 | 0.9% |
| 0.9978 | 27 | 0.8% |
| 0.997 | 27 | 0.8% |
| 0.9974 | 25 | 0.8% |
| Other values (852) | 2877 |
| Value | Count | Frequency (%) |
| 0.98711 | 1 | |
| 0.98722 | 1 | |
| 0.9874 | 1 | |
| 0.98742 | 2 | |
| 0.98746 | 2 | |
| 0.98758 | 1 | |
| 0.98774 | 1 | |
| 0.98779 | 1 | |
| 0.98794 | 2 | |
| 0.98816 | 1 |
| Value | Count | Frequency (%) |
| 100.369 | 2 | |
| 100.315 | 3 | |
| 100.295 | 2 | |
| 100.289 | 1 | < 0.1% |
| 100.242 | 2 | |
| 100.196 | 1 | < 0.1% |
| 100.044 | 2 | |
| 100.038 | 2 | |
| 100.037 | 2 | |
| 100.025 | 1 | < 0.1% |
pH
Real number (ℝ)
| Distinct | 98 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2363115 |
| Minimum | 2.74 |
|---|---|
| Maximum | 4.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 2.74 |
|---|---|
| 5-th percentile | 2.97 |
| Q1 | 3.12 |
| median | 3.23 |
| Q3 | 3.35 |
| 95-th percentile | 3.52 |
| Maximum | 4.01 |
| Range | 1.27 |
| Interquartile range (IQR) | 0.23 |
Descriptive statistics
| Standard deviation | 0.16505503 |
|---|---|
| Coefficient of variation (CV) | 0.05100097 |
| Kurtosis | 0.36148226 |
| Mean | 3.2363115 |
| Median Absolute Deviation (MAD) | 0.11 |
| Skewness | 0.29060474 |
| Sum | 10327.07 |
| Variance | 0.027243162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.16 | 99 | 3.1% |
| 3.26 | 96 | 3.0% |
| 3.22 | 90 | 2.8% |
| 3.2 | 88 | 2.8% |
| 3.36 | 83 | 2.6% |
| 3.24 | 83 | 2.6% |
| 3.18 | 82 | 2.6% |
| 3.3 | 78 | 2.4% |
| 3.14 | 78 | 2.4% |
| 3.15 | 78 | 2.4% |
| Other values (88) | 2336 |
| Value | Count | Frequency (%) |
| 2.74 | 1 | < 0.1% |
| 2.79 | 1 | < 0.1% |
| 2.8 | 1 | < 0.1% |
| 2.82 | 1 | < 0.1% |
| 2.83 | 4 | 0.1% |
| 2.85 | 3 | 0.1% |
| 2.86 | 8 | |
| 2.87 | 4 | 0.1% |
| 2.88 | 11 | |
| 2.89 | 5 |
| Value | Count | Frequency (%) |
| 4.01 | 2 | |
| 3.9 | 2 | |
| 3.85 | 1 | < 0.1% |
| 3.78 | 2 | |
| 3.76 | 1 | < 0.1% |
| 3.75 | 3 | |
| 3.74 | 1 | < 0.1% |
| 3.72 | 3 | |
| 3.71 | 4 | |
| 3.7 | 1 | < 0.1% |
sulphates
Real number (ℝ)
High correlation 
| Distinct | 110 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.57401128 |
| Minimum | 0.23 |
|---|---|
| Maximum | 2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 0.23 |
|---|---|
| 5-th percentile | 0.36 |
| Q1 | 0.47 |
| median | 0.55 |
| Q3 | 0.65 |
| 95-th percentile | 0.86 |
| Maximum | 2 |
| Range | 1.77 |
| Interquartile range (IQR) | 0.18 |
Descriptive statistics
| Standard deviation | 0.16666078 |
|---|---|
| Coefficient of variation (CV) | 0.29034409 |
| Kurtosis | 8.8869131 |
| Mean | 0.57401128 |
| Median Absolute Deviation (MAD) | 0.09 |
| Skewness | 1.874581 |
| Sum | 1831.67 |
| Variance | 0.027775817 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.54 | 126 | 3.9% |
| 0.5 | 120 | 3.8% |
| 0.56 | 115 | 3.6% |
| 0.58 | 102 | 3.2% |
| 0.6 | 101 | 3.2% |
| 0.52 | 101 | 3.2% |
| 0.53 | 101 | 3.2% |
| 0.48 | 94 | 2.9% |
| 0.57 | 87 | 2.7% |
| 0.49 | 86 | 2.7% |
| Other values (100) | 2158 |
| Value | Count | Frequency (%) |
| 0.23 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.26 | 3 | 0.1% |
| 0.27 | 6 | 0.2% |
| 0.28 | 2 | 0.1% |
| 0.29 | 5 | 0.2% |
| 0.3 | 9 | |
| 0.31 | 15 | |
| 0.32 | 11 | |
| 0.33 | 17 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 1.98 | 1 | < 0.1% |
| 1.95 | 2 | |
| 1.62 | 1 | < 0.1% |
| 1.61 | 1 | < 0.1% |
| 1.59 | 1 | < 0.1% |
| 1.56 | 1 | < 0.1% |
| 1.36 | 3 | |
| 1.34 | 1 | < 0.1% |
| 1.33 | 1 | < 0.1% |
alcohol
Real number (ℝ)
High correlation 
| Distinct | 85 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.630809 |
| Minimum | 8.4 |
|---|---|
| Maximum | 14.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 8.4 |
|---|---|
| 5-th percentile | 9.1 |
| Q1 | 9.5 |
| median | 10.5 |
| Q3 | 11.4 |
| 95-th percentile | 12.8 |
| Maximum | 14.9 |
| Range | 6.5 |
| Interquartile range (IQR) | 1.9 |
Descriptive statistics
| Standard deviation | 1.2125479 |
|---|---|
| Coefficient of variation (CV) | 0.1140598 |
| Kurtosis | -0.57885207 |
| Mean | 10.630809 |
| Median Absolute Deviation (MAD) | 0.95 |
| Skewness | 0.534312 |
| Sum | 33922.91 |
| Variance | 1.4702725 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.5 | 200 | 6.3% |
| 9.4 | 170 | 5.3% |
| 9.2 | 131 | 4.1% |
| 11 | 120 | 3.8% |
| 9.8 | 117 | 3.7% |
| 10.5 | 107 | 3.4% |
| 10 | 98 | 3.1% |
| 11.2 | 90 | 2.8% |
| 9.6 | 89 | 2.8% |
| 10.4 | 87 | 2.7% |
| Other values (75) | 1982 |
| Value | Count | Frequency (%) |
| 8.4 | 5 | 0.2% |
| 8.5 | 5 | 0.2% |
| 8.6 | 2 | 0.1% |
| 8.7 | 19 | 0.6% |
| 8.8 | 33 | 1.0% |
| 8.9 | 16 | 0.5% |
| 9 | 68 | |
| 9.05 | 1 | < 0.1% |
| 9.1 | 71 | |
| 9.2 | 131 |
| Value | Count | Frequency (%) |
| 14.9 | 1 | < 0.1% |
| 14.2 | 1 | < 0.1% |
| 14.05 | 1 | < 0.1% |
| 14 | 9 | |
| 13.9 | 2 | 0.1% |
| 13.8 | 2 | 0.1% |
| 13.7 | 3 | 0.1% |
| 13.6 | 13 | |
| 13.55 | 1 | < 0.1% |
| 13.5 | 6 |
quality
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7828267 |
| Minimum | 3 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 178.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 8 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.83054896 |
|---|---|
| Coefficient of variation (CV) | 0.14362335 |
| Kurtosis | 0.23399302 |
| Mean | 5.7828267 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.16061177 |
| Sum | 18453 |
| Variance | 0.68981157 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1432 | |
| 5 | 1094 | |
| 7 | 491 | 15.4% |
| 4 | 92 | 2.9% |
| 8 | 68 | 2.1% |
| 3 | 14 | 0.4% |
| Value | Count | Frequency (%) |
| 3 | 14 | 0.4% |
| 4 | 92 | 2.9% |
| 5 | 1094 | |
| 6 | 1432 | |
| 7 | 491 | 15.4% |
| 8 | 68 | 2.1% |
| Value | Count | Frequency (%) |
| 8 | 68 | 2.1% |
| 7 | 491 | 15.4% |
| 6 | 1432 | |
| 5 | 1094 | |
| 4 | 92 | 2.9% |
| 3 | 14 | 0.4% |
Interactions
Correlations
| alcohol | chlorides | citric acid | density | fixed acidity | free sulfur dioxide | pH | quality | residual sugar | sulphates | total sulfur dioxide | type | volatile acidity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| alcohol | 1.000 | -0.399 | 0.087 | -0.653 | -0.208 | -0.028 | 0.116 | 0.475 | -0.161 | -0.016 | -0.099 | 0.257 | -0.139 |
| chlorides | -0.399 | 1.000 | -0.045 | 0.653 | 0.579 | -0.444 | 0.257 | -0.311 | -0.176 | 0.457 | -0.519 | 0.742 | 0.576 |
| citric acid | 0.087 | -0.045 | 1.000 | 0.086 | 0.286 | 0.068 | -0.351 | 0.174 | 0.074 | 0.143 | 0.111 | 0.486 | -0.401 |
| density | -0.653 | 0.653 | 0.086 | 1.000 | 0.608 | -0.254 | 0.072 | -0.299 | 0.274 | 0.395 | -0.273 | 0.000 | 0.376 |
| fixed acidity | -0.208 | 0.579 | 0.286 | 0.608 | 1.000 | -0.426 | -0.125 | -0.118 | -0.119 | 0.409 | -0.474 | 0.600 | 0.336 |
| free sulfur dioxide | -0.028 | -0.444 | 0.068 | -0.254 | -0.426 | 1.000 | -0.263 | 0.109 | 0.330 | -0.318 | 0.801 | 0.495 | -0.449 |
| pH | 0.116 | 0.257 | -0.351 | 0.072 | -0.125 | -0.263 | 1.000 | -0.074 | -0.287 | 0.276 | -0.383 | 0.465 | 0.416 |
| quality | 0.475 | -0.311 | 0.174 | -0.299 | -0.118 | 0.109 | -0.074 | 1.000 | 0.065 | 0.046 | 0.015 | 0.192 | -0.341 |
| residual sugar | -0.161 | -0.176 | 0.074 | 0.274 | -0.119 | 0.330 | -0.287 | 0.065 | 1.000 | -0.184 | 0.423 | 0.551 | -0.188 |
| sulphates | -0.016 | 0.457 | 0.143 | 0.395 | 0.409 | -0.318 | 0.276 | 0.046 | -0.184 | 1.000 | -0.423 | 0.522 | 0.302 |
| total sulfur dioxide | -0.099 | -0.519 | 0.111 | -0.273 | -0.474 | 0.801 | -0.383 | 0.015 | 0.423 | -0.423 | 1.000 | 0.794 | -0.497 |
| type | 0.257 | 0.742 | 0.486 | 0.000 | 0.600 | 0.495 | 0.465 | 0.192 | 0.551 | 0.522 | 0.794 | 1.000 | 0.683 |
| volatile acidity | -0.139 | 0.576 | -0.401 | 0.376 | 0.336 | -0.449 | 0.416 | -0.341 | -0.188 | 0.302 | -0.497 | 0.683 | 1.000 |
Missing values
Sample
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Moscatel | 8.1 | 0.24 | 0.32 | 10.5 | 0.030 | 34.0 | 105.0 | 0.99407 | 3.11 | 0.42 | 11.8 | 6 |
| 1 | Moscatel | 5.8 | 0.23 | 0.20 | 2.0 | 0.043 | 39.0 | 154.0 | 0.99226 | 3.21 | 0.39 | 10.2 | 6 |
| 2 | Moscatel | 7.5 | 0.33 | 0.36 | 2.6 | 0.051 | 26.0 | 126.0 | 0.99097 | 3.32 | 0.53 | 12.7 | 6 |
| 3 | Moscatel | 6.6 | 0.38 | 0.36 | 9.2 | 0.061 | 42.0 | 214.0 | 0.99760 | 3.31 | 0.56 | 9.4 | 5 |
| 4 | Moscatel | 6.4 | 0.15 | 0.29 | 1.8 | 0.044 | 21.0 | 115.0 | 0.99166 | 3.10 | 0.38 | 10.2 | 5 |
| 5 | Moscatel | 6.5 | 0.32 | 0.34 | 5.7 | 0.044 | 27.0 | 91.0 | 0.99184 | 3.28 | 0.60 | 12.0 | 7 |
| 6 | Moscatel | 7.5 | 0.22 | 0.32 | 2.4 | 0.045 | 29.0 | 100.0 | 0.99135 | 3.08 | 0.60 | 11.3 | 7 |
| 7 | Moscatel | 6.4 | 0.23 | 0.32 | 1.9 | 0.038 | 40.0 | 118.0 | 0.99074 | 3.32 | 0.53 | 11.8 | 7 |
| 8 | Moscatel | 6.1 | 0.22 | 0.31 | 1.4 | 0.039 | 40.0 | 129.0 | 0.99193 | 3.45 | 0.59 | 10.9 | 5 |
| 9 | Moscatel | 6.5 | 0.48 | 0.02 | 0.9 | 0.043 | 32.0 | 99.0 | 0.99226 | 3.14 | 0.47 | 9.8 | 4 |
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3221 | Syrah | 6.6 | 0.725 | 0.20 | 7.8 | 0.073 | 29.0 | 79.0 | 0.99770 | 3.29 | 0.54 | 9.2 | 5 |
| 3222 | Syrah | 6.3 | 0.550 | 0.15 | 1.8 | 0.077 | 26.0 | 35.0 | 0.99314 | 3.32 | 0.82 | 11.6 | 6 |
| 3223 | Syrah | 5.4 | 0.740 | 0.09 | 1.7 | 0.089 | 16.0 | 26.0 | 0.99402 | 3.67 | 0.56 | 11.6 | 6 |
| 3224 | Syrah | 6.3 | 0.510 | 0.13 | 2.3 | 0.076 | 29.0 | 40.0 | 0.99574 | 3.42 | 0.75 | 11.0 | 6 |
| 3225 | Syrah | 6.8 | 0.620 | 0.08 | 1.9 | 0.068 | 28.0 | 38.0 | 0.99651 | 3.42 | 0.82 | 9.5 | 6 |
| 3226 | Syrah | 6.2 | 0.600 | 0.08 | 2.0 | 0.090 | 32.0 | 44.0 | 0.99490 | 3.45 | 0.58 | 10.5 | 5 |
| 3227 | Syrah | 5.9 | 0.550 | 0.10 | 2.2 | 0.062 | 39.0 | 51.0 | 0.99512 | 3.52 | 0.76 | 11.2 | 6 |
| 3228 | Syrah | 6.3 | 0.510 | 0.13 | 2.3 | 0.076 | 29.0 | 40.0 | 0.99574 | 3.42 | 0.75 | 11.0 | 6 |
| 3229 | Syrah | 5.9 | 0.645 | 0.12 | 2.0 | 0.075 | 32.0 | 44.0 | 0.99547 | 3.57 | 0.71 | 10.2 | 5 |
| 3230 | Syrah | 6.0 | 0.310 | 0.47 | 3.6 | 0.067 | 18.0 | 42.0 | 0.99549 | 3.39 | 0.66 | 11.0 | 6 |
Duplicate rows
Most frequently occurring
| type | fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 192 | Moscatel | 7.0 | 0.15 | 0.28 | 14.70 | 0.051 | 29.0 | 149.0 | 0.99792 | 2.96 | 0.39 | 9.0 | 7 | 8 |
| 227 | Moscatel | 7.3 | 0.19 | 0.27 | 13.90 | 0.057 | 45.0 | 155.0 | 0.99807 | 2.94 | 0.41 | 8.8 | 8 | 8 |
| 233 | Moscatel | 7.4 | 0.16 | 0.30 | 13.70 | 0.056 | 33.0 | 168.0 | 0.99825 | 2.90 | 0.44 | 8.7 | 7 | 7 |
| 232 | Moscatel | 7.4 | 0.16 | 0.27 | 15.50 | 0.050 | 25.0 | 135.0 | 0.99840 | 2.90 | 0.43 | 8.7 | 7 | 6 |
| 12 | Moscatel | 5.7 | 0.22 | 0.20 | 16.00 | 0.044 | 41.0 | 113.0 | 0.99862 | 3.22 | 0.46 | 8.9 | 6 | 5 |
| 123 | Moscatel | 6.6 | 0.22 | 0.23 | 17.30 | 0.047 | 37.0 | 118.0 | 0.99906 | 3.08 | 0.46 | 8.8 | 6 | 5 |
| 140 | Moscatel | 6.7 | 0.16 | 0.32 | 12.50 | 0.035 | 18.0 | 156.0 | 0.99666 | 2.88 | 0.36 | 9.0 | 6 | 5 |
| 239 | Moscatel | 7.5 | 0.24 | 0.31 | 13.10 | 0.050 | 26.0 | 180.0 | 0.99884 | 3.05 | 0.53 | 9.1 | 6 | 5 |
| 13 | Moscatel | 5.7 | 0.22 | 0.22 | 16.65 | 0.044 | 39.0 | 110.0 | 0.99855 | 3.24 | 0.48 | 9.0 | 6 | 4 |
| 27 | Moscatel | 6.0 | 0.20 | 0.26 | 6.80 | 0.049 | 22.0 | 93.0 | 0.99280 | 3.15 | 0.42 | 11.0 | 6 | 4 |